Geometric variational problems of 
statistical mechanics and of 
combinatorics 

Senya Shlosman* 
May, 17, 1999 

Abstract 

We present the geometric solutions of the various extremal prob- 
lems of statistical mechanics and combinatorics. Together with the 
Wulff construction, which predicts the shape of the crystals, we discuss 
the construction which exhibit the shape of a typical Young diagram 
and of a typical skyscraper. 

1 Introduction 

1.1 Statistical mechanics 

The variational problems of statistical mechanics we are going to discuss here 
are those related to the formation of a droplet or a crystal of one substance 
inside another. The question here is: what shape such a formation would 
take? The statement that such shape should be denned by the minimum 
of the overall surface energy subject to the volume constraint was known 
from the times immemorial. In the isotropic case, when the surface tension 
does not depend on the orientation of the surface, and so is just a positive 
number, the shape in question should be of course spherical (provided we 
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neglect the gravitational effects). In a more general situation the shape in 
question is less symmetric. The corresponding variational problem is called 
the Wulff problem. Wulff formulated it in his paper [W|] of 1901, where he 
also presented a geometric solution to it, called the Wulff construction (see 
section 2.2 below). 

This Wulff construction was considered by the rigorous statistical me- 
chanics as just a phenomenological statement, though the notion of the sur- 
face tension was among its central notions. The situation changed after 
the appearance of the book |PKS|| . There it was shown that in the setting 
of the canonical ensemble formalism, in the regime of the first order phase 
transition, the (random) shape occupied by one of the phases has asymp- 
totically (in the thermodynamic limit) a non-random shape, given precisely 
by the Wulff construction! In other words, a typical macroscopic random 
droplet looks very close to the Wulff shape. The results of the book [OKS 



are restricted to the 2D Ising ferromagnet at low temperature, though the 
methods of the book are suitable for the rigorous treatment of much more 
general two-dimensional low-temperature models. Physical intuition is that 
as soon as there is phase coexistence, these results should be valid. It was 
proven in JT2], [T3J to be the case for the 2D Ising model at all subcritical 
temperatures. Some results for the higher dimensional case were obtained in 
Po| , |CcP| ], For the independent percolation the corresponding results were 
obtained in |[ACC|| for the 2D case, and in ||Cej| in the 3D case. 



1.2 Combinatorics. 

The main content of the present paper concerns the problems arising in 
combinatorics, so in this section we describe some of them in more details. 

A partition p of an integer iV is a collection of non-negative integers 
ni > ri2 > ... > rik > such that YliLi n i = N. It can be specified by the 
sequence {r^} of integers, with = I iff exactly I elements of p equal k. It 
can also be described by the monotone function 

oo 

4> P (y) = rfc - 

k=\y] 

Its graph G [</> ] provides a graphical description of p and is called a (2D) 
Young diagram. 
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Similarly, a plane partition P of an integer N is a two-dimensional array of 
non- negative integers n^, such that for any % we have tin > n i2 > ••• > ?^ifc > 

for any j we have n\j > n 2 j > ... > n;y > while again 5^}=i n<ij = -/V. 
One defines the corresponding function cf> P (1/1,2/2) in the obvious way The 
function (j) P (yi, y 2 ) is monotone in each variable. Its graph G [<f> P ] is called a 
3D Young diagram or a skyscraper. 

Many more objects of a similar type can be defined. For example, one 
can put restrictions on how the steps of the stair G [4> p ] can look: they can 
not be longer than 3 units, and their heights can be only 1,2 or 5, say. The 
same freedom is allowed in 3D, and above. 

Let us fix the number N, choose the kind of diagrams we are interested 
in, and consider the corresponding set T>n of all these diagrams. There are 
finitely many of them, so we can put a uniform probability distribution on 
T>n- (Here, again, variations are possible.) The question now is the following: 
how the typical diagram from the family T>n looks like, when N — ► 00? 



The first problem of that type was solved in the paper VK , see also 



||V 1| , |V2| , pVZ|| . It was found there, that the typical 2D Young diagram 



under statistics described above, if scaled by the factor \ l/y/~N \ , tends to 
the curve 

exp {-7 = e x } + exp {-7 = 6 v } = L (1) 

More precisely, for every e > the probability that the scaled Young diagram 
would be within distance e from the curve ([I]), goes to 1 as iV — > 00. 
The heuristic way to obtain ([!]) (and similar results) is the following: 

i) Let A = (01, a?) , B — {pi, 62) be two points in Z 2 , with a\ < b\, a 2 > b 2 . 
We can easily see that the number # {A, B) of lattice staircases, starting from 
A, terminating at B, and allowed to go only to the right or down, is given by 
( (6l ~S 1 ) -£)~ 6a) )- Therefore one concludes by using the Stirling formula that 

lim —}—-\n#{A,B) = h{n AB ). (2) 

\B-A\-^oo \B — A\ 

Here i\.ab is the unit vector, normal to the segment [A, B] , and for n = 
{rii, n 2 ) , a — n ^_ n2 , the entropy function h (n) = — {a In a + (1 — a) In (1 — a)) 

ii) One argues that the number of Young diagrams of the area N scaled 
by vN, "going along" the monotone curve y = c (x) > with integral one, 
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is approximately given by 



exp 



hi- d{x) =, . 1 Jl + (c'(x)fdx\. 

J V \/^ + (c'(x)) 2 V /l + (c'(x))V J 

(3) 

m) Assuming that indeed the model under consideration exhibits under a 
proper scaling some typical behavior, described by a nice smooth non-random 
curve (or surface) C, one comes to the conclusion that the curve C should be 
such that the integral in ([3]), computed along C, is maximal compared with 
all other allowed curves. 

In general case one is not able to write down the corresponding entropy 
function precisely. The only information available generally is the existence 
of the limit of the type of (0), by a subadditivity argument. It should be 
stressed that even when the variational problem for the model is known, the 
main difficulty of the rigorous treatment of the model is the proof that indeed 
it does exhibit a nontrivial behavior after a proper scaling. 

The above program was realized in ||V1] , |V2|| , see also [ pVZ]| , for the 2D 



case described above and for some other cases. In [|Bj] a class of more general 
2D problems was studied. The first 3D problem was successfully studied in 
||CKP|| . The method of the last paper can also solve the skyscraper problem, 
as is claimed in [ |Kc| |. 

When compared with the situation in statistical mechanics, the combi- 
natorial program and its development look very similar. The only difference 
is that the counterpart of the Wulff construction was not designed in combi- 
natorics, probably because there was no heuristic period there. In this note 
we fill this lack of parallelism by presenting such a construction. It provides, 
like the Wulff one, the geometric solution to the corresponding variational 
problem under minimal restrictions on the initial data, and also proves the 
uniqueness of the solution. 

In the next section we first remind the reader about the Wulff minimizing 
problem (sect. 2.1) and the Wulff construction (sect. 2.2), which solves 
this problem, and then present the corresponding maximizing problem of 
combinatorics (sect. 2.3) and the geometric construction for its solution 
(sect 2.4), which is our main result. We give the proof in the section 3. 



2 Statement of results 
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2.1 Wulff minimizing problem. 

Let S d C ~R d+l denote the unit sphere, and let the real function r on S d be 
given. We suppose that the function is continuous, positive: r (•) > const > 
0, and even: r (n) = r (— n) . Then for every hypersurface M d C M, d+1 we 
can define the Wulff functional 

W T (M d ) = [ r(n x ) ds x . (4) 

JM d 

Here x G M d is a point on the manifold M d , the vector n x is the unit 
vector parallel to the normal to M d at x, and ds is the usual volume ci-form 
on M d , induced from the Riemannian metric on M. d+1 by the embedding 
M d C M. d+1 . Of course, we need to assume that the normal to M d is defined 
almost everywhere, i.e. that M d is smooth enough. Let now D q be the 
collection of all closed hypersurfaces M d , embedded in R d+1 , and such that the 
volume vol (M d ) inside M d equals q. The Wulff problem consists in finding 
the lower bound of W r over D\ : 

w T = inf W r (M), (5) 

as well as the minimizing surface(s) W T , such that W T (W T ) = w T , if it exists. 
It turns out that the above variational problem indeed can be solved. It has 
a unique solution, which is given by the following 

2.2 Wulff construction ([jWj]). 

The minimizer W T can be obtained as follows. For every n e S d , A > 
define the half-space 

L< (n; A) = jxG R d+1 : (x, n) < A r (n) } , (6) 

and let 

K< (A) = P| L< (n;A) , (7) 

n£S d 

M T (\)=d(K<(\)). (8) 

The bodies K< (A) are called Wulff bodies. We define Ai as the value of A, 
for which vol (M T (A)) = 1. Then we define W T = M T (Ai) . The surface W T 
is called the Wulff shape. This is the minimizer we are looking for. 
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The paper [T2"] contains a simple proof that W r (W T ) < W r (M) for every 
M E Di. The uniqueness of the minimizing surface is proven in ||T1|| . It is 
known that in dimension 2 the minimizing surface W T of the functional W r 
is not only unique, but also is stable in the Hausdorf metric; for the proof, 



sec 



|DKSl , Sect. 2.4. 



2.3 Maximizing problem. 

In a dual problem we again have a function 77 of a unit vector, but this time it 
is defined only over the subset A d = S d D IR+ +1 of them, lying in the positive 
octant. We suppose again that the function is continuous and nonnegative: 
77 (■) > 0. We assume additionally that 

77 (n) — > uniformly as n — > <9 A d . (9) 

Let now G C IR+ +1 be an embedded hypersurface. We assume that for almost 
every xEG the normal vector n x is defined, and moreover 

n x . G A d for a.e. xEG. (10) 

Then we can define the functional 

V n {G) = [ rj(n x ) ds x . (11) 

J G 

In analogy with the section 2.1 we introduce the families D q ,q > 0, of such 
surfaces G as follows: 
G e D q iff 

i) G splits the octant M+ +1 into two parts, with the boundary <91R+ +1 belong- 
ing to one of them, 

ii) the ((d+ l)-dimensional) volume of the body Q (G) , enclosed between 
cM^ 1 and G, equals q. In what follows we denote the volume of Q (G) by 
vol (G) . 

For example, let / (y) > be a function on IR^, non-increasing in each of d 
variables, and G [f] C be its graph. Then 

vol (£[/])= / f(y)dy, (12) 

so if f Rd f (y) dy = q, then G [f] is an element of D g , provided the function 
/ is sufficiently smooth. 
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Our problem now is to find the upper bound of V v over Di : 

v, n = sup V v (G) , (13) 

GeD! 

as well as the maximizing surface(s) V v G Di, such that V v (K,) = v v , if 
possible. Note that the last problem differs crucially from (|5|), since here 
we are looking for the supremum. In particular, this upper bound evidently 
diverges if taken over all surfaces, and not only over "monotone" one, in the 
sense of (|TUD, unlike in the problem (|). 

It turns out that there exists a geometric construction, which provides 
a solution to the variational problem flTB]), in the same way as the Wulff 
construction solves the problem (|5|). 



2.4 The main result. 

For every n G A d , A > define the half-space 

L> (n;A) = {x G R d+1 : (x, n) > \ V (n)} , (14) 

and let 

K> (A) = P| L> (n;A) (15) 

neA d 

G v (A) = d {K> (A)) . (16) 

Because of (||), the surfaces G v (A) are graphs of functions, /i 1 (j/) , y G K.^., 
i.e. G„ (A) = G [/, A ] . 

Theorem 1 Suppose the integrals vol(G v (X)) (see l^j)) are converging. 
Then the functional V v has a unique maximizer, V v , over the set D\. It is 
given by the above construction fildj): 

where Ai satisfies vol (G v (Ai)) = 1, and the maximum of the functional v v = 
V v (V v ) , (see ftn\)). If the integrals vol (G v (A)) diverge, then v v = oo. 

As we already said in the introduction, in all known cases the heuristic 
arguments of the Section 1.2 turn out to be correct, and are validated by 
corresponding (sometime quite hard) theorems proven. For example, they are 
valid for the problem of finding the asymptotic shape of the Young diagram, 
described in the Section 1.2, as was proven in ||VK| , |V1| , |V2] |. Therefore, the 
following statement holds: 
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Corollary 2 In the notations of the theorem above, the curve exp | — -^x j + 

exp j — — 1 from the formula ([Jj coincides with the curve (Ai) , 

given by our construction applied to the function r\ (n) = h (n) from the 
formula 

Of course, this statement can also be easily checked directly. 



3 The proof of the Theorem. 

We start with the case of finite volumes: vol {G n (A)) < oo for all A. 

We will prove our theorem by showing that for any surface G G D\,G ^ 
V v , which coincides with G [V v ] outside some big ball around the origin of 
M. d+1 , we have 

V v (G) > V v (V„) . 

First, we need more detailed notation than in the previous section. For 
every x G M d+1 , n G S d , k > we define the half-spaces 

L> (x, n;«) = {y G R d+1 : (y - x, n) > «} 

and the planes 

L= (x, ii;k) = {y G R d+1 : (y - x, n) = k) . 

Let C C IR d+1 be a convex set, and x G C. The support function r X) c (•) is 
defined by 

t X;C (n) = inf {k : L> (x, n;re) n C = 0} ; 

we put r Xi c (n) = oo if L > (x, n;«) fl C 7^ for all «. We denote by K the 
convex set K> (A = Ai) , introduced in (|X5|) , and we use the notation G for 
the surface <9 (K) . 

Let e > 0. Introduce the set R^ +1 = {y = (y x , G R+ +1 : y< > e} , 

and define K (e) = K n G (e) = d (K (e)) . The family of the subsets 

G (e) = (G (e) n G) C G is increasing, with U e>0 G (e) = G. 

Let N = N (e)_be so big, that the cube B N = {y = {y x , y d+1 ) G : < Vi < N } 
contains the set G (e) . We denote by x^ the vertex (N, N) of this cube. 
Consider the convex set U = H K (e) . We are going to define with its help 
a function T N (n) = T N ^ £ (n) on S' rf . First, let n G — (A ') ; in other words, n 
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has all coordinates non-positive. Note that by definition the support plane 
Lr (xjv,n;r Xjv q (n)) intersects the set G [e] . In case when this intersection 
contains "inner" points of G (e) , i.e. points not in <9G (e) = G D d , we 

put 

T N {n) = -{x N ,n)-r]{-n)>0, (17) 

where rj is our initial function (|9]). We use the same definition (|T7|) for 
remaining n-s in — (A d ) , for which the intersection 

L= (0,-n; f) (-n))nG(^| 

For future use we denote the set of n-s, where the function T N is already 
defined, by (- (A d £ )) ; note that A d -» A d as e ->• 0. For the remaining n G 
- (A d \ Af) we define (n) = r Xjv (n) . For n-s in S d \ A d the function 
Tn (n) is defined by applying multiple reflections in the coordinate planes. 
In other words, the values Tjv (±ni, ±712, ±71^+1) do not depend on the 
choice of signs. Analogously, we define the convex set U as the union of U 
and all its multiple reflections in coordinate planes shifted by x^v- 

It follows from the definitions above that the set U is nothing else but 
the shift of the Wulff body K£ (1) by the vector x^. According to what was 
said in the section 2.2, for every M G D vo i(au)j M 7^ <9U, 

W Tn (9U) < W Tn (M) . (18) 

Consider now an arbitrary hypersurface H, such that <9H = dG (e) , while the 
set v (H) of its unit normal vectors belongs to the subset A^ C A d (which is 
the case for the surface G (e) itself). Then for any such H 

V„ (H) + W Tn (H) = iVv^vol (tt (dG (e))) , (19) 

where 7r (9G (e)J is the projection of the "curve" dG (e) (of codimension 2) on 
the hyperplane {y : yi + ... + ya+i = 0} C and where vol Ok (dG (e))) 



is the ((d — l)-dimensional) volume inside it. The relation (|19D follows from 
(|T7j). Therefore the minimality property (0) of the functional Wj^ on the 
surface G (e) implies the maximality property of the functional V v on the 
same surface! 

The uniqueness statement for V v is therefore a corollary of the uniqueness 
for W. 

It remains now to consider the question when the volumes vol (G v (A)) 
are infinite for all A. We are going to show that in that case v„ = 00. To make 
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things look simpler, we restrict ourselves to the 2D case. Let G C G v (1) be 
an arc, and consider the "triangle" A (G) C M 2 , made from all the points of 
all the segments joining the origin to the curve G : 

A(G) = UIM- 

xeG 

It is straightforward to see that 

vol(A(G)) = iv,(G). 

We now will present the family G 7 £ such that V n (C T ) — > oo as 7 — > 0. 
Namely, for every A we define the number N (A) to be the size of the square 
B (A) = {y G R 2 : < Vi < N (A)} for which vol (Q (G v (A)) n B (A)) = 1, 
and we put G 7 to be the part of the boundary of the intersection Q (G v (7)) fl 
5 (7) , which is visible from the point (2 A?" (7) , 2iV (7)) , say. The curve G 7 
consists of a certain arc G 7 of the curve G v (7) and two small segments, 
joining its endpoints to the coordinate axes. By construction, vol (A (G 7 )) > 
|. On the other hand, vol (A (<5 7 )) = ^V v (G 7 ) , which implies that 

m > 



4 Conclusion 

In this paper we have described the explicit geometric construction, which 
predicts the asymptotic shape of some combinatorial objects. It is worth 
mentioning that the method presented should work whenever the underlying 
probability measure has certain locality property, namely that the distant 
portions of the combinatorial object under consideration are weakly depen- 
dent. This locality property is in fact the key feature behind the results 
obtained in the papers cited above. It also holds for the corresponding prob- 
lems of statistical mechanics, like the validity of the Wulff construction, and 
is crucial there as well. 
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